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REGULATION OF EMBRYONIC TR.\NSCRlPTION IN PLANTS 



HELD OF THE INVENTION 

The invention is in the field of nucleic acid sequences capable of regulating 
5 transcription, paniculariy sequences that may promote transcription during embryocenesis in 
plants. 

BACKGROUND OF THE INVENTION 

Most of the information about seed-specific gene expression comes from studies of 

1 0 genes encoding seed storage proteins like napin, a major protein in the seeds of Brassica 

napus. or conglycinin of soybean. Upstream DNA sequences directing strong embryo-specific 
expression of these storage proteins have been used successfully in transgenic plants to 
manipulate seed lipid composition and accumulation ( Voelker et al.. 1996). However, 
expression of storage protein genes begins fairly late in embryogenesis. Thus, promoters of 

1 5 seed storage protein genes may not be ideal for all seed-specific applications. For example, 
storage oil accumulation commences significantly before the highest level of expression of 
either napin ( Stalberg et al.. 1 996) or conglycinin (Chen et aL, 1 988) is achieved, h is. 
therefore of interest to identif>^ other promoters which may modulate expression of genes in 
developing plant embr\'os. 

20 A variety of transcriptional regulatory regions that may he active during plant 

embn'ogenesis are known, as disclosed for example m: U.S. Patent No. 5,792.922 issued 1 1 
.August 1Q98 to Moloney: L'.S. Patent No. 5.623.067 issued 22 .\pril 1^97 to \'anGckcrckho\ c 
ct al.; Intemanonal Patent Publication \\'09845461 published 15 October i^^^^S. Tlicre remam^ 
a need for aitcmative transcriptional regulatory regions. 

25 FATTY ACID ELOSGATIOM {FAETi genes encode condensing enzymes in\oi\ ed iri 

plant \ cry long chain tatt>' acid biosynthesis. The FAEl condensing enzyme is thought to be 
localized m the endoplasmic reticulum where it catalyzes the sequentiai elongation of CIS 
lattv ac>l chains to C22 in length (Kunst et al.. 1992). FAEI genes ha\e ncen cioncd and 
descrfned receniU' b\' James et al. { 1^^5 ). International Patent Publication WO 9h 13582. 



SUMMARY OF THE INVENTION 

In one aspect, the mvention pro\'ides transcnptional regulatorv regions dern ed from 
FAEI genes. The transcnptional regulatorv regions of the invention may be useful in 
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promoiing early seed-speciilc transcnpnon oi heterologous sequences to which they arc 
operably linked. The transcnpdonai regulatory regions of the invention may be used in a wide 
variety of plants, including Brassica sp.. Arabidopsis and other plant species. DNA constructs 
compnsing the transcriptional regulator}' sequences of the invention may be active during 
fatty acid or lipid biosynthesis in the plant embryo. Certain embodiments of the constructs of 
the invention may be used in transgenic plants to promote expression of heterologous 
sequences m developing seeds. In vanous embodiments, the constructs of the invention may 
be used to mediate gene expression that affects seed lipid metabolism, or seed protein 
composition or seed carbohydrate composition, or seed development. In altcmauve 
embodiments, the transcriptional regulator)' regions of the invention may also be useful for the 
production of modified seeds containing novel recombinant proteins which have 
pharmaceutical, industrial or nutritional value. 

BRIEF DESCRIPTION OF THE DRAWINGS 

kigure I shows a 934 bp DKA sequence comprising the Arahidopsis thaliana FAEl 
transcription regulatory sequence. 

Figure 2 shows a 1588 bp DNA sequence compnsing the Brassica napus FAEl 
transcription regulatory sequence. 

Figure 3 shows a 1069 bp DNA sequence compnsing the Lunaria annua FAEl 
transcription regulatory sequence. 

Figure 4^shows an alignment of the Arabidopsis thaliana {A.t.). Lunana annua {L.a. \ 
and Brassica napus (B.n. ) iranscnption regulatory sequences. .Asterisks below the sequences 
indicate identical nucleotides in each of the three sequences. .\ number of putam e cis-acting 
sequence motifs arc idemiiied m the A. thaliana sequence: an EMI ABA box at -44bp to - 
36bp ha\'ing the sequence ACATCTCAT, for which the published consensus sequence is 
ACGTGTCAT (Rowley. D.L and Herman. E.M. (1997), Biochimica et Biophysica Acta 
1345:1-4); an A-300 box at -51 bp to -46bp having the sequence TGCAAT. for which the 
published consensus sequence is TG(T/A C1AAA(G Ti (Morton et al. ( 1994) in Seed 
Development and Germination (KigeL J. and Galliii. G., eds.) pp. 103-1 ?S. Marcel Dekker. 
New \\:>rk); G-bo\ 1 at -105 to -100 bp having the sequence CACATG, for which is the 
consensus sequence is CACCTG, and Q-box 2 at -164 to -159 bp having the sequence 
C.AACTT- for which the consensus sequence is CAACTG (Kawogoe. V. and Murai. N. 
( 1992) Plant J. ::9:'7-936. CEI element at v::6 to -218 bp having ihe sequence 
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AcCATCGA. for which the consensus sequence is TGCCACCGG. and a CE3 eiement at - 
38lbp to -369 bp having the sequence ACACATTCCCTC, for which the consensus sequence 
is ACGCGTGTCCTC (Shen ei al., (1996) Plant Cell 8:1 107- 111 9). Not highlighted is a 
putative RY repeat motif at -53bp to -47bp having the sequence CATGCAA, for which the 
consensus sequence is CATGCAT (Dickinson et al. (1988) Nucleic Acid Res. 16:371; 
Lelievrc et al. (1992) Plant Physiol. 98:387-391). Also shown, as Con. 4. is a consensus 
sequence, wherein R^G or A. Y^T/TJ or C, M=A or C. K=G or TfU. S=G or C. \V=A or T.L\ 
B=G or C or T/U. D=A or G or T;r , H=A or C or T/L\ V=A or G or C and N=A or G or C or 

Figure 5 shows an alignment of the Arabidopsis thaliana (A.t.) and Lunaria annua 
(Lm.) transcription reguiator\' sequences. Asterisks below the sequences indicate identical 
nucleotides in each of the two sequences. The base at position -400 in the A J. sequence is 
highlighted. The alignment of sequences in both Figure 4 and Figure 5 was accomplished 
using the CLUSTALW program (version 1.74) for multiple sequence alignments, using a gap 
open penalty of 15. a gap extension penalty of 6.66 and an lUB DNA weight matrix. Also 
shown, as Con. 5, is a consensus sequence, wherein R^G or A. Y=T''U or C. M=A or C. K^G 
or TV. S=G or C, W=A or TV^ B-G or C or T/V, D-A or G or TV, H=A or C or T/TJ. V=A 
or G or C and N=A or G or C df TAJ. 

Figure 6 includes two bar graphs illustrating hydroxy fatty acid content of A) FAEI- 
FAH12 and B) napin-FAHI2 transgenic seeds, expressed as percentage of total seed fatty 
acids. 

Figure 7 shows an alignment of the Brassica nanus ( B.n. ) and Lunaria annua ( L.a. ) 
FEAl transcription regulatory sequences. .Asterisks beiow the sequences mdicate identical 
nucleotides in each of the two sequences. 

Figure 8 shows an alignment of the Brassica napus [B.n. ) and Arabidopsis ihaiiana 
[A J.) FEAl transcription regulator)' sequences. Asterisks below the sequences mdicale 
identical nucleotides in each of the two sequences. 

DETAILED DESCRIPTION OF THE INVENTION 

The recombinani nucleic acid molecules of the m\'ention may comprise a heterologous 
promoter sequence operably linked to a nucleic acid sequence, wherein the promoter 
sequence comprises a transcriptional regulatory region capable of mediating seed-specific 
expression in Arabidopsis. The transcriptional regulatory region may be obtainable from a 
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plant FAEl gene. Alterntively. The transcriptional regulatory region may hybridize under 
stringent conditions to a 5' region of the plant FAEl gene. In funher alternative 
embodiments. The transcriptional regulatory region may be at least 70^ identical when 
optimally aligned to the 5' region of the plant FAEl gene. 
5 In alternative embodiments, the invention provides isolated nucleic acids comprising 

the transcriptional regulatory regions of the invention. By isolated, it is meant that the 
isolated substance has been substantially separated or punfied away from other biological 
components with which it would otherwise be associated, for example in vivo. The term 
'isolated' therefore includes substances purified by standard punfication methods, as well as 
10 substances prepared by recombinant expression in a host, as well as chemically synthesized 
substances. 

In the context of the present invention, "transcriptional regulatory region" means a 
nucleotide sequence capable of mediating or modulating transcription of a nucleotide 
sequence of interest, when the transcriptional regulatory region is operably linked to the 

! 5 sequence of interest. Conversely, a transcriptional regulatory region and a sequence of interest 
are "operably linked" when the sequences are functionally connected so as to permit 
transcription of the sequence of interest to be mediated or modulated by the transcriptional 
regulatory region. In some embodiments, to be operably linked, a transcriptional regulatory 
region may be located on the same strand as the sequence of interest. The transcriptional 

20 regulatory region may in some embodiments be located 5' of the sequence of interest. In such 
embodiments, the transcriptional regulator}' region may be directly 5' of the sequence of 
interest or there may be interx'ening sequences between these regions. Tlie operable linkage of 
the transcriptional regulatory region and the sequence of interest may require appropriate 
molecules (such as transcriptional activator proteins) to be bound to the transcriptional 

25 regulator.' region, the invention thcretbrc encompasses embodim.ents in which such molecules 
are provided, either in vara or in vivo. 

The term "recombinant" means that something has been reeombined, so that when 
made m reference to a nucleic acid molecule the term refers to a molecule that is comprised ot 
nucleic acid sequences that are joined together h\' means of molecular biological techniques. 

^^0 The term "recombinant ' when made in relerence to a protein or a polypeptide reters to a 
protein molecule which is expressed using a recombinant nucleic acid molecule. The term 
"heterologous" when made in reference to a nucleic acid sequence refers to a nucleotide 
sequence which is ligated to. or is manipulated to become ligated to. a nucleic acid sequence 

- 4 - 
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to which it IS not Hgated m nature, or to which it is iigated at a different location m nature. The 
term "heterologous" therefore indicates that the nucleic acid molecule has been manipulated 
using genetic engineering, i.e. by human inter\'ention. 

Sequences may be denved or obtainable from plant FAEI genes by deduction and 
5 synthesis based upon the wild-type FAEI gene sequences. Derived sequences may be 

identiiled in different organisms, for example by isolation using as probes the nucleic acid 
sequences of the invention. Alternative transcriptional regulatory regions may be derived 
through mutagenesis or substitution of wild-type sequences, such as the sequence disclosed 
herein. Derived nucleic acids of the mvention may be obtained by chemical synthesis, 

1 0 isolation, or cloning from genomic DNAs using techniques known in the an. such as the 

Polymerase Chain Reaction (PCR). Consensus sequences, such as those illustrated in Figures 
4 and 5 are alternative embodiments of the nucleic acids of the invention, derived from the 
disclose wild-type FAEI gene sequences. Nucleic acids of the present invention may be used 
to design alternative pnmers (probes) suitable for use as PCR primers to amplify panicular 

15 regions of an FAEI gene. Such PCR primers may for example comprise a sequence of 15-20 
consecunve nucleotides of the sequences of the invention. To enhance amplification 
specificity, primers of 20-30 nucleotides in length may also be used. Methods and conditions 
for PCR amplification are described in Innis et al. (1990); Sambrook et al. (1989); and 
Ausubel et al. (1995). As used herein, the term "probe" when made in reference to an 

20 oligonucleotide refers to an oligonucleotide which is capable of hybndizing to another 

oHgonucleotide of interest. A probe may be single-stranded or double-stranded. Probes arc. for 
example, useful in the detection, identilkation, amplification and isolation of panicular gene 
sequences. Oligonucleotide prooes may be labelled with a "reponer molecule." so that the 
probe is detectable using a detection system, such as enzymatic, tluorescent. radioactive or 

25 luminescent detection systems. 

Denved nucleic acids of the invention ma>' also be identified by hybridization, such as 
Southern or Northern analysis. Southern analysis is a method b>' which the presence of DNA 
sequences in a target nucleic acid mixture are identified bv hybridization to a labeled probe, 
comprising an oligonucleotide or DN.A fragment of a nucleic acid of the invention. Probes tor 

?0 Southern anaivsis may for example be at least 15 nucleotides m length. Southern -nalvsis 

typically involves electrophoretic separation of DNA digests on agarose gels, denaturation of 
the DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, nylon, or 
another suitable membrane support for analysis with a radiolabeled, biotinyiated. or enzyme- 
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labeled probe as descnbed in Sambrook et ai ( 1989). Similarly. Northern analysis may be 
used to identif>' RNAs thai hybridize to a knowTi probe such as an oligonucleotide. DNA 
fragment, cDNA or fragment thereof, or RNA fragment of a nucleic acid of the invention or a 
known FAEl sequence. The probe may be labeled with a radioisotope such as ^*'P, by 
5 biotinylaiion or with an enzyme. Tne RNA to be analyzed may be electrophoretically 

separated on an agarose or polyacr\'lamide gel, transferred to nitrocellulose, nylon, or other 
suitable membrane, and hybridized with the probe, using standard techniques well known m 
the art such as descnbed in Sambrook et ai (1989). 

In alternative embodiments, a transcriptional regulatory region of the invention may 

10 be at least 70% identical when optimally aligned to the 5' region of a plant FAEl gene, such 
as the Arahidopsis FAEl gene. In alternative embodiments, the degree of identity may be 
between 50% and 100%, such as 60%, 80%. 90%. 95% or 99%. WTien a position in the 
compared sequence is occupied by the same nucleotide or amino acid, following optimal 
aligrmient of the sequences, the molecules are considered to have identity at that position. The 

15 degree of identity between sequences is a function of the number of matching positions shared 
by the sequences. In terms of percentage, identity is the sum of identical positions, divided by 
the total length over which the sequences are aligned, multiplied by 100. 

Various aspects of the present invention encompass nucleic acid or amino acid 
sequences that are homologous to other sequences. As the term is used herein, an amino acid 

20 or nucleic acid sequence is "homologous" to another sequence if the two sequences are 

substantially identical and the functional activity of the sequences is conser\'ed ( for example, 
both sequences function as or encode a FAEl enzyme: as used herein, the term homologous 
does not infer e\'olutionary rciatcdness ). Nucleic acid sequences may also be homologous if 
they encode substantially identical amino acid sequences, even if the nucleic acid sequences 

25 are not themselves substantially identical, a circumstance that may for example arise as a 
result of the degeneracy of the genetic code. 

Two amino acid or nucleic acid sequences are considered substantiaih' identical if 
when opnmalh' aligned (with gaps pemiitted). the> share at least about 50° o sequence 
similant\ or identity, or if the seuuences share detlned functional motifs. In altemam e 

30 embodiments, sequence similarity in optimalh- aligned substantiallv identical sequences ma\ 
be at least 60" o. TO^'^o. 80° o. 90? o or 95° o. As used herein, a given percentage of homology 
between sequences denotes the degree of sequence identity in optimally aligned sequences. 

- 6 - 
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Optimal aiigmnent of sequences for compansons of simiianty may be automated usmg 
a vanety of algorithms, such as the local homology algonthm of Smith and Waterman (1981) 
Adv. Appl. Math 2: 482, the homology aiignment algonthm of Needleman and Wunsch (1970) 
J. Mol. Biol. 48:443. the search for similarity method of Pearson and Lipman ( 1988) Proc. 
5 Natl. Acad. Sci. USA 85: 2444. and the computenzed implementations of these algorithms 
{such as GAP. BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package. 
Genencs Computer Group, Madison, WI. U.S.A. ). Sequence similarity may also be 
determined using the BLAST algonthm. described in Altschul et ai (1990), J. Mol. Biol. 
215:403-10 (using the published default settings). Software and instructions for performing 

10 BLAST analysis may be available through the National Center for Biotechnology Information 
in the United States (including the programs BLASTP, BLASTN, BLASTX, TBLASTN and 
TBLASTX that may be available through the internet at hup://vvwvv.ncbi.nim.nih.GOvO . The 
BLAST algonthm involves tlrst identifying high scoring sequence pairs (lISPs) by identifying 
shon words of length W in the query sequence that either match or satisfy some positive- 

1 5 valued threshold score T when aligned with a word of the same length in a database 
(reference) sequence. T is referred to as the neighborhood word score threshold. Initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs. The word hits 
are extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Extension of the w^ord hits in each direction is halted when the 

20 following parameters are met: the cumulative alignment score falls off by the quantity X from 
its maximum achieved value; the cumulative score goes to zero or below, due to the 
accumulation of one or more negative-sconng residue alignments: or the end of either 
sequence is reached. The BLAST algonthm parameters W. T and X detennine the sensitivit) 
and speed of the alignment. The BLAST program may use as defaults a word length iW) of 

25 1 1, the BLOSUM62 sconng matnx (Henikoff and Henikoff (1992) Proc. SatL Acad Sci. i'SA 
89: 10915-1091 9). a gap existence cost of 1 1, a per residue gap cost of I . a lambda ratio of 
0.85, alignments i B) of 50. expectation (E) of 10, M=5. N=4, and a companson of both 
strands. One measure of the statistical similarity between two sequences using the BLAST 
algorithm is the smallest sum probability (P(N) ). which provides an indication of the 

J^O probability by which a match between two nucleotide or amino acid sequences wc^uld occur 
by chance. In alternative embodiments of the invention, nucleotide or amino acid sequences 
are considered substantially identical if the smallest sum probabilitv in a comparison of the 
test sequences is less than about L preferably less limn about 0.1, more preferably less than 
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about 0.01, and most preferably less ihan about 0.001. In the PSI-BLAST implementation of 
the BLAST algorithm, an expect value for inclusion in PSI-BLAST iteration may be 0.001 
(Altschul et al. (1997), Nucleic Acids Res. 25:3389-3402). Searching parameters may be 
varied to obtain potentially homologous sequences from database searches. 
5 An alternative indication that two nucleic acid sequences are substantial!) identical is 

that the two sequences hybridize to each other under moderately stringent, or preferably 
stnngcnt, conditions. Hybndization to filter-bound sequences under moderately strmgent 
conditions may. for example, be performed in 0.5 M NaHP04, 7% sodium dodecyl sulfate 
(SDS). 1 mM EDTA at 65EC. and vvashmg m 0.2 \ SSC/0.1% SDS at 42EC (see AusubeL ci 

10 ill. (eds), 1989. Current Protocols in Molecular Biolog\\ Vol. 1. Green Publishing Associates. 
Inc., and John Wiley &. Sons. Inc.. New York, at p. 2.10.3). Alternatively, hybridization to 
filter-bound sequences under stnngent conditions may, for example, be performed in 0.5 M 
NaHPOa, 7% SDS, 1 mM EDTA at 65EC. and washing in OT x SSC/OT% SDS at ()8EC {see 
AusubeL ei al. (eds). 1989, supra). Hybridization conditions may be modified in accordance 

1 5 with known methods depending on the sequence of interest (see Tiissen. 1993, Laboratory 
Techniques in Biochemistry and Molecular Biology - Hybridization with Xucleic Acid 
Probes. Part I, Chapter 2 "Overview of principles of hybridization and the strateg}' of nucleic 
acid probe assays", Elsevier. New York). Generally, stnngent conditions are selected to be 
about SEC lower than the thermal melting point for the specific sequence at a defined ionic 

20 strength and pH. 

A FAEI promoter is any naturalh' occurring transcriptional regulatory region that 
mediates or modulates the expression of a plant FAEI condensing enzyme. Plant FAEI 
condensing enzymes arc proteins that are homologous to known FAEI condensing enzymes, 
such as those cloned and described in International Patent Publication W'O 96/13582. 

25 Heterologous DNA sequences may for example be introduced into a host cell by 

transformation. Such heterologous molecules may include sequences derived trom the host 
cell species, which have been isolated and reintroduced into ceils of the host species. 
Heterologous nucleic acid sequences may become integrated into a host cell genome, either as 
a result of the original transformation of the host cells, or as the result of subsequent 

30 recombination events. Tr:mstormation techniques that may be employed mclude plant ceil 
membrane disruption by electroporation, microinjection and polyethylene glycol based 
transtbrmaiion (such as are disclosed in Paszkowski t7 a/. EMBO J. 3:2717(1984): Fromm ci 
al., Proc. Sutl. Acad Sci. USA 82:5824 ( 1985): Rogers et al.. Mefhoas Enz\'mot. i 18:627 
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(1986); and in U.S. Patent Nos. 4.6S4.61 1; 4.801.540: 4.743,548 and 5.231,019), bioiistic 
transformation such as DNA panicle bombardment (for example as disclosed in Klein, ei ai. 
Nature 327: 70 (1987); Gordon-Kamm. et ai ''The Plant CelF' 2:603 (1990); and in U.S. 
Patent Nos. 4,945,050; 5,015,580; 5,149,655 and 5,466,587); .-Igraft^jcvmz/m-mediated 
transfomiation methods (such as those disclosed in Horsch e( ai Science 233: 496 (1984); 
Fraley et ai, Proc. Sat 'I Acad. Sci. USA 80:4803 (1983); and U.S. Patent Nos. 4,940.838 and 
5,464.7631. 

Standard methods are available for the preparation of constructs for use in identifying 
and characterizing transcriptional regulatory' regions useful in vanous embodiments of the 
invention. General molecular techniques may for example be performed by procedures 
generally described by Ausubel FM, Brent R. Kingston RE. Moore DD. Seidman JG. Smith 
JA. Stuhl K. (1995) Current Protocols in Molecular Biology, Vols 1. 2 and 3. Alternative 
equivalent methods or variations thereof may be used in accordance with the general 
knowledge of those skilled in this an and the functional requirements of the present invention. 

In some aspects of the mvention, transformed plant cells may be cultured to regenerate 
whole plants having a transformed genotype and displaymg a desired phcnorype, as for 
example modified by the expression of a heterologous protein mediated by a transcriptional 
regulatory region of the invention. A vanety of plant culture techniques may be used to 
regenerate whole plants, such as are described in Gamborg and Phillips, "Plant Cell, Tissue 
and Organ Culture. Fundamental Methods", Springer Berlin, 1995); Evans ci ai ''Protoplasts 
Isolation and Culture". Handbook of Plant Cell Culture. Macmillian Publishing Company. 
New \'ork. 1*^83: or Binding, "Regeneration of Plants. Plant Protoplasts", CRC Press. Boca 
Raton. 1^85; or m Klee ct ai, Ann. Rev. of Plant Phys. 38:467 (1987). A cell, tissue, organ, or 
organism uuo which has been introduced a foreign nucleic acid, is considered "transformed", 
•"translected". or ■"transgenic". A transgenic or transformed cell or organism also includes 
progen\ of the cell or organism and progeny produced from a breeding program employ mg a 
transgenic plant as a parent in a cross and exhibiting an altered phcnotvpe resulting from the 
presence of a recombinant nucleic acid construct. A transgenic plant is iheretorc a plant that 
has been transformed with a heterologous nucleic acid, or the progeny of such a plant that 
includes the transgene. The invention provides vectors, such as vectors for transtorming plants 
or plant cells. The term "vector" in reference to nucleic acid molecule generally refers to a 
molecule that may be used to transfer a nucleic acid segment! s) from one cell to another. One 
of skill will recognize that after the nucleic acid :s stably mcorporatcd in transgenic plants and 
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conilrmed to be operable, it can be introduced into other plants by sexual crossing. .^Vny or a 
number of standard breeding techniques may be used, depending upon the species to be 
crossed. 

In various embodiments, the invention comprises plants transformed with the nucleic 
5 acids of the invention. In some embodiments, such plants will exhibit altered fatty acid 
content in one or more tissues. These aspects of the invention relate to all higher plants, 
includmg monocots and dicots. such as species from the genera Fragana. Lotus, Medicagu^ 
Onobrychis. Triforium, Trigonelia, Wgna, Citrus, Linum. Geranium, Manihot, Caucus. 
Arabidopsis, Brassica, Raphanus, Sinapis. Atropa. Capsicum, Hyoscyamus, Lycopersicon. 

10 Sicotiana, Solanum. Petunia Digitalis, Majorana, Cichonum, Helianthus, Laciuca, Bromus, 
Asparagus, Antirrhinum, Heterocatlis, Nemesia, Pelargonium, Panicum, Penniserum. 
Ranunculus, Senecio, Salpiglossis, Cucarnis, Browallia, Glycine, Lolium, Zea, Tnticum. 
Sorghum, and Datura. Such plants may include maize, wheal, rice, barley, soybean, beans, 
rapeseed, canola. alfalfa, flax, sunflower, cotton, clover, lettuce, tomato cucurbits, potato 

15 carrot, radish, pea lentils, cabbage, broccoH, brussel sprouts, peppers, apple, pear, peach. 

apricot, carnations and roses. More specifically, in alternative embodiments, plants tor which 
the invention may be used in modifying fatty acid content include oil crops of the Cruciferae 
family: canola, rapeseed (Brassica spp.), crambe (Crambe spp.), honesty (Lunana spp.) 
lesquerella (Lesquerela spp.), and others; the Composirae family: sunflower [Helianthus 

20 spp.), safflower [Carthamus spp.), niger (Guizotia spp.) and others: the Palmae family: palm 
[Elaeis spp.), coconut iCocos spp.) and others; the Leguminosae family: peanut {Arachis 
spp.). soybean [Glycine spp.) and others: and plants of other families such as maize [Zca spp.i, 
cotton {iiossvpiun sp.), jojoba [Simonasia sp.), flax {Linum sp.), sesame iSesamum spp.), 
castor bean [Ricinus spp.), olive [Olea spp.), poppy [Papaver spp.), spurge [Euphorbia, spp. ), 

25 meadowfoam [Limnanthes spp.), mustard (Sinapis spp.) and cuphea {Cuphea spp.). 

Nucleic acids of the invention may also be used as a plant breeding tool, as molecular 
markers to aid in plant breeding programs. Such techniques would include using the gene 
itself as a molecular probe or using the DN.A sequence to design PCR primers :o use PCR 
based screening techniques in plant breeding programs. 

30 Deletion or insertion constructs may be usetui for domain mapping lo determine the 

functional domains or motifs of a transcriptional regulatory region denved from a FAR! gene. 
.\n aspect of the invention is the construction and testing of such constructs, as described 
below for the 5' deletion construe: of the A. thaliana FAEI 5' region. One aspect of the 
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invention compnscs iranscnptionai reguiatorv^ regions that are derived trom runcnonaiiy 
important regions oiaFAEl promoter. As outlined above, the functionally imponant regions 
of slFAEI promoter may be determmed through routme assays. Alternatively, randomly 
selected portions of a a FAEI promoter may be selected for use in routine assays to determme 
5 whether the selected region is capable of functioning as a transcriptional regulatory region in 
the context of the present invention. In various embodiments, regions of the Arabidopsis 
thaliana. Brassica napus or Lunaria annua promoters may be used. For example, the 
following motifs in the A.i. FAEI promoter may be used alone or in combination in novel 
transcriptional regulatory regions (see Figure 4): the CE-like elements (CEl and CE3), the R^' 

10 repeat motif, the G-boxes (G-boxl and G-box2), the A-300 box, the EMI ABA box. or the 
CTATTTTG element. Constructs of the invention compnsing such motifs, deletions or 
insertions may be assayed for activity as transcriptional regulator}^ regions of the invention by 
tesdng for strong seed-specific activity providing expression of a sequence of interest (such as 
a reporter sequence) before the torpedo stage and persisting throughout embryo development. 

15 in accordance with standard testing methods that may be adapted from the methods disclosed 



Alternative embodiments of the transcriptional regulatory regions of the invention may 
be identified using information available through NCBI databases at http://www.ncbi.nih.gov. 
In various embodiments, transcriptional regulator)' regions derived from plant FAEI 
20 genes are shown to be capable of directing expression of desired genes at an early stage o: 
development in a seed-specitlc manner m disparate plant species. In particular embodimen:s. 
the transcriptional regulatory regions of the invention may be used in a wide \'arietv o: 
dicotyledonous plants for moditlcation of the seed phcnotypc. For example, new sccl: 
phenotypes may include: 
25 (1) altered seed fatty acid composition or seed oil composition and accumulation 

(2) altered seed protein or carbohydrate composition or accumulation 

(3) enhanced production of desirable endogenous seed products 

(4) suppression of production of undesirable gene products using aniisense. co-suppression or 



30 15) production of novel recombinaiu proteins for pharmaceutical, industrial or nutritional 



herein. 



ribozyme technologies 



purposes 



Isolation of a seed-specific promoter from A thaluma 
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Vsing the sequence informaiion of the A. ihaliana genome sequencing projeci. 
synthetic oiigonucietide pnmers were designed to ampHfy the FAEl 5' untranslated region, to 
isolate it by PGR. As shown in Figure 1, the upstream primer 5'- 
CTAGTAGATTGGTTGGTTGGTTTCC-3' (AtproFW) in combination with the downstream 
5 pnmer 5*-TGCTCTGTTTGTGTCGGA.A.AAT.A.\TGG-V (AtproRV) were used, and resulted 
in the synthesis of a fragment of the correct size (934 bp). The amplified product was 
subcloned in the Hindi site of the plasmid pT7T3-I8U (Pharmacia) to produce plasmid 
pT7T3-18LVpiroFAE900, follow^ed by complete sequence determmaiion of both strands to 
venf\' the fragment identity. A BLAST search of the A. (haliana Database identified a single 
10 BAG clone T4L20 (GenBank ATF10M6) 125,179 bp long, which contains the complete 
FAEl^Qxis,. 

Functional analysis of the FAEl 5' upstream region 

5' upstream fragments of the FAEl gene were showTi to confer seed-specilic and 
temporally regulated gene expression in plants. A translational fusion was made between the 

15 FAEl 5' region and the coding region of the reporter gene P-glucuronidase (GUSV The 
chimeric gene (pFAE900-GUS or pFAE400-GUS) was transferred into Arabidopsis and 
tobacco and GUS activity' was monitored in various tissue of transgenic plants. 

(^instruction of the vectors pFAE900-GUS and pFAE400-GUS. and transformation of 
Arabidopsh and tobacco, w^as as follows. The insert w^as cleaved out of pT7T3-18U vector 

20 with HindlW and Xba\ and direciionally subcloned into the corresponding sites of the binary 
Ti plasmid pBIlOl iClontech), which contains a promotcrless GUS gene (Jefferson et al. 
1987). to obtain the vector pFAE900-GljS. Another construct. pFAE400-GUS. contaming 
only 3^-^3 bp of the 5* FAEl region directl>' upstream of the ATG inuiation codon rused to the 
GUS coding sequence was also generated. For that, the pT7T3- 1 8U/ proF.\E900 \ cctor was 

25 digested with D^llI and PstL the sticky ends were filled in using r4 DNA poivmerase, 
followed by re-ligation to obtain pT7T3-l 8U/proFAE400. The 393 bp 5" FAEl upstream 
fragment was then excised with Hindlll and AT^al and cloned into the binary \ ector pBIlOl to 
obtain the plasmid pFAE40Q-GUS. The pFAE400-GUS ana pFAE900-GUS fusion constructs 
in pBIlOl were mtroduced into Agrohacierium lumeraciem strain G\'3101 (Koncz and 

30 Scheli. i^^86) by heat-shock and selected for resistance to kanamycin (50 ug ml). A thalian.i 
(L.) Heynh. ecotype Columbia was transformed with the pFAE400-GUS and pFAE900-GUS 
constructs using tloral dip method (^Clough and Bent, 1998). Screening for transformed seed 
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was done on SOug/mL kanamycin as described previously (Katavic et al.. 1994). 
Approximately 100 transgenic lines were generated for each construct. 

For transformation of tobacco, A. tumefaciens harbouring the pFAE900-GUS construct 
was co-cultivated with leaf pieces oi Nicotiana lahacum SRI and transformants were selected 
5 with kanamycin ( lOOiig'mL) on solid medium ( Lee and Douglas, 1996). 

Histochemical localization of GUS activity and analysis of transgenic plants was as 
follows. Tissue sections were placed in 100 mM NaPOa (pH7) and i mM spermidine for 15 

min, then mcubated at 37^ C in 0.5 K3[Fe{CN)6], 0.01 Triton X-100, ImM EDTA, 10 mM 
P-mercaptocthanoL 5-bromo-4-chloro-3-indolyl-P-D-glucuronide in 100 mM NaPOa (pH7). 

10 until a blue color appeared (after approximately 1 hr). Following incubation with the substrate, 
chlorophyll was removed from the sections using a graded ethanol series. 

Using this assay, five independent transgenic Arabidopsis lines were examined for the 
embr>'o-specific expression of the GUS gene. In addition, leaf, stem and siliques were 
hisiochemically stamed for P-glucuronidasc activity. The results indicate that the reporter 

15 gene fused to the transcripuonal regulatory region of the invention is not expressed in 
vegetative tissues, whereas it is highly expressed in developing seeds (embryos). Both the 934 
bp and the 393 bp transcriptional regulator}^ regions derived from the Aa. FAEl gene caused 
the appearance of GUS activity by the torpedo stage embryo (6 days after flowermg). GUS 
activity in all five lines persisted throughout subsequent embr>'0 development. 

20 Leaves, stems, pods and seeds of three regenerated tobacco lines transformed with the 

pFAE^OO-GUS construct were also assayed for P-glucuronidase activity. The results obtained 
mdicatc that the 934 bp FAEl promoter tragment contains sulTicicnt information to direct 
seed-spccilk expression of a reponcr gene in transgenic tobacco. Thus the transcriptional 
regulatory regions of the invention may be used for seed-spcciric expression of foreign genes 

25 m transgenic plants. 

The in vivo acti\'ity of a FAE! promoter of the invention was compared to the activity 
of the napin promoter by expressing the casror bean hydroxylase gene FAH12 (Broun and 
SomerMlie. 1997) behind either the f J£"/-promoter ;a transcriptional rcgulatorN region o{ 
anproximatei>' 1 kbi or the napin promoter m an Arabidopsis Jdd2.fuci double rnutanr. Thas 

30 mutant accumulates as a proportion of fatty acids about 85° o of the 18:1 acyi group, which is 
the substrate for the hydroxylase. The le\els of hydroxylated fatty acids accumulating m a 
large number of independent transgenic lines were used to estimate the relative strength of 
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each promoter. As shown m Figure o. the two populations of transgenic plants accumulated 
levels of hydroxylated fatty acids, rangmg from 0.2% to about iI-12% of total fatty acids, 
with the levels being on average slightly higher in FAE1-FAH12 lines. Similarly, the best 
FAE1-FAH12 plant accumulated just over \2% of hydroxylated fauy acids (w;vv of total FAs), 
5 whereas the best napin'FAHI2 plant produced 10.8% of hydroxylated fatty acids (wvw* of total 
FAs). These results indicate that the FAEl promoter is highly active in transgenic Arabidopsis 
and that its in vivo activity may be superior to napm in Arabidopsis seeds. 

Sequence elements or motifs that confer both tissue specificity and developmental 
regulation of transcription reside within 393 bp of the AUG translation initiation codon in the 
10 A J. FAEl gene. The seed-specific expression conferred by the transcriptional regulatory 
regions of the invention is independent of the native terminator of the FAEl gene 3' end. For 
example, in the exemplified constructs disclosed herein, a terminator denved from the 
Agrobactcrium nopaline synthase gene was used. 



charactenzed to demonstrate that regulatory regions conferring seed-specific transcription 
early in embr>'0 development can also be found upstream of other plant FAEl genes. 
Sequences were cloned using the technique of polymerase chain reaction (PCR) walking on 
uncloned plant genomic DNA (Devic et al., 1997). Approximately 5 \xg of genomic DNA 

20 trom 1 g of fresh tissue was used for the construction of 5 different libraries by digesting DNA 
with a series of enzymes that produce blunt end fragments to which special adaptors are 
ligated. Tlie adaptor molecules consist of a long upper strand, which contains successi\e 
sequences common to the adaptor primers. API and AP2. annealed at its 3' end to a shorter 
strand lacking the \?\ sequence. However, this short strand posseses an amine group at its 3* 

25 end to prevent filling m by the DNA polymerses during the first PGR amplification step and 
generation of the API binding site. This suppression PGR effect prevents exponential 
amplification of molecules containing the adaptor at each end. and the adaptor primer binding 
sites are only produced when a strand complementar\' to the upper strand of the adaptor is 
synthesized by extension from a gene specific pnmer. The first PGR reaction is performed 

30 using an adaptor pnmer API and a gene specific pnmer. An aliquot of the first PGR product is 
used a template in a second PGR amplification using the nested gene specific pnmer and AP2. 

In order to isolate the regulator}' regions upstream of the B. napus FAEl coding 
sequence, genomic DNA was prepared from developing leaves and digested with f blunt-enu 



Lunaria annua and Brassica nanus FAEl 5'regulatorv reeions 



15 



Two sequences originating from B. napus and L. annua were isolated and 
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cittting resmciion enzymes (DraL EcoRK HpaL PvuII and Seal) to generate a series of DN A 
libraries. After ligation of adapter molecules, individual libraries were used as templates m a 
tu'o step PCR. In the first PCR amplificauon using the API pnmer 5'- 
GGATCCT.A.ATACGACTCACTATAGGGCo^ and the FAEl gene specific pnmer 5'- 
5 .AAAGAGTGGAGCGATGGTTATGAGG-3' (Bnwalkl), multiple DNA fragments were 
amplified from all five librar>' templates. After a second round of PCR. using the AP2 pnmer 
5"-CTATAGGGCTCGAGCGGC-3' and the nested FAEI specific pnmer 5"- 
CGG/VA7\G.AAGC.AA.AGGTTGA.AA.AGG-3^ (Bnwalk21. the longest single fragment of 1.6 
kb was obtained from the Hpal librar>' template. Tliis fragment was insened into the pCRZ.l 

10 plasmid (Invitrogen) and sequenced. The sequence is shown in Figure 2. 

For the PCR walking experiment to isolate the L. annua 5' regulatory region, in 
adition to the standard API and AP2 pnmers, the following FAEI specific primers were used: 
5'-GATCGTTTGTGGTAAG ACGAGAGC-3 ' ( Lawalk 1) and 5" - 

GTCAGTGGGAAG.AAACAGAGGTTG-3' {Lawalk2). In the first PCR reaction, the Dral, 

15 EcoRW Pvull Seal and Ssp! libran^ templates were used. In a second PCR amplification the 
longest single fragment 1.1 kb in length was synthesized using the EcoRV library template. 
This fragment was inserted into the Hindi site of the pT7T3-18U vector (Promega), 
sequenced on both strands and analyzed ( Figure 3). 

Using the sequence data obtained for the 5' regulatory regions generated by PCR 

20 walking, specific pnmers were generated for the amplification of the L. annua and B. napus 
FAEI promoter fragments. For the PCR-amplification of D. napus promoter fragment the 
upstream pnmer was 5'-CTGACTTCACCAAAGAAACAACTCG-3' (BnproF\\') in 
combmation u ith the downstream primer 5'-CGGAA rTCCGTTTTTTTTTTTAGGCG-3' 
(BnproR\'). The synthesized fragment was Heated into the Smal site of pGEM-'Zf 

25 (Promega), then excised wnh Xhal BamHI and cloned into the equivalent sites of the pBIlOl 
binaPv' vector (Clontech). L. annua 5* regulatory region was amplified using the ^'- 
CAGCTT.A/\CCGGTAAAArTGGCC-3' (LaproFW) upstream pnmer together with the 5'- 
TGTTCAGTTTTGTGTCGG.\GAGG-3' (LaproRVl downstream pnmer and insened in ihe 
Hind! site of pT"T3-18L' (Promega) plasmid. In order to clone the L. annua promoter 

30 iragment into the pBIlOl binary vector, an .Vbal site wag added b> subcloning the PstI Kpnl 
fragment released from the pT7T3-lSU vector into pBFuescnpt II KS^ (Stratagene). The 
fragment was then excised and cloned in the Xbal site of the pBIlOl vector. 
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The resulting vectors pBriFAEl-GUS and pLaFAEl-GUS in pBIlOl were then 
introduced into A. tumefaciens strain GV3101 by heat-shock, and used to transform 
Arabidopsis as described above. Transformants were selected on agar-solidified medium 
containing kanamycin (50 p.g/ml). More than 100 transformants were generated for each 
construct. The activity of the L. annua and B. napus FAEl promoters was determined by GUS 
expression assays on the developing seeds and also on non-reproductive plant tissues as 
controls. Consistent seed-specirlc GUS expression was obtained for both promoter constructs 
in independent transgenic lines. In contrast, there was no detectable GUS activity in leaf, stem 
and silique samples. 
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